Very low bit rate parametric audio coding

نویسنده

  • Heiko Purnhagen
چکیده

In this thesis, a parametric audio coding system for very low bit rates is presented. It is based on a generalized framework that combines different source models into a hybrid model and thereby permits flexible utilization of a broad range of source and perceptual models. The developed parametric audio coding system allows efficient coding of arbitrary audio signals at bit rates in the range of approximately 6 to 16 kbit/s. The use of a hybrid source model requires that the audio signal is being decomposed into a set of components, each of which can be adequately modeled by one of the available source models. Each component is described by a set of model parameters of its source model. The parameters of all components are quantized and coded and then conveyed as bit stream from the encoder to the decoder. In the decoder, the component signals are resynthesized according to the transmitted parameters. By combining these signals, the output signal of the parametric audio coding system is obtained. The hybrid source model developed here combines sinusoidal trajectories, harmonic tones, and noise components and includes an extension to support fast signal transients. The encoder employs robust algorithms for the automatic decomposition of the input signal into components and for the estimation of the model parameters of these components. A perceptual model in the encoder guides signal decomposition and selects the perceptually most relevant components for transmission. Advanced coding schemes exploit the statistical dependencies and properties of the quantized parameters for efficient transmission. The parametric approach facilitates extensions of the coding system that provide additional functionalities. Independent time-scaling and pitch-shifting is supported by the signal synthesis in the decoder. Bit rate scalability is achieved by transmitting the perceptually most important components in a base layer bit stream and further components in one or more enhancement layers. Error robustness for operation over error-prone transmission channels is achieved by unequal error protection and by techniques to minimize error propagation and to provide error concealment. The resulting coding system was standardized as Harmonic and Individual Lines plus Noise (HILN) parametric audio coder in the international MPEG-4 Audio standard. Listening tests show that HILN achieves an audio quality comparable to that of established transform-based audio coders at 6 and 16 kbit/s.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Low Complexity Parametric Stereo Coding in Mpeg - 4

Parametric stereo coding in combination with a State-of-the-Art coder for the underlying monaural audio signal results in the most ef cient coding scheme for stereo signals at very low bit rates available today. This paper reviews those aspects of the parametric stereo paradigm that are important for audio coding applications. A complete parametric stereo coding system is presented, which was r...

متن کامل

Advances in Parametric Audio Coding

Parametric modelling provides an efficient representation of general audio signals and is utilised in very low bit rate audio coding. It is based on the decomposition of an audio signal into components which are described by appropriate source models and represented by model parameters. Perception models are utilised in signal decomposition and model parameter coding. This paper gives a brief t...

متن کامل

Speeding up HILN – MPEG-4 Parametric Audio Encoding with Reduced Complexity

Parametric modelling permits an efficient representation of audio signals and is utilised for very low bit rate coding by the MPEG-4 Standard. Here we look at the MPEG-4 parametric audio coding tools ”Harmonic and Individual Lines plus Noise” (HILN) which are based on a decomposition of the audio signal into components that are described by appropriate source models and represented by model par...

متن کامل

Error Protection and Concealment for HILN MPEG-4 Parametric Audio Coding

The HILN (Harmonic and Individual Lines plus Noise) MPEG-4 parametric audio coding tool allows efficient representation of general audio signals at very low bit rates. Therefore possible applications include transmission over IP or wireless channels which are both characterised by specific transmission error models. On the other hand, since parametric audio coding is a relatively new technique ...

متن کامل

Calculation of an entropy-constrained quantizer for exponentially damped sinudoids parameters

The Exponentially Damped Sinusoids (EDS) model can efficiently represent real-world audio signals. In the context of low bit rate parametric audio coding, the EDS model could bring a significant improvement over classical sinusoidal models. The inclusion of an additional damping parameter calls for a specific quantization scheme. In this report, we describe a new jointscalar quantization scheme...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008